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DETECTION METHOD FOR C-RAF-1 GENES 

5 



Field of the Invention 

1 o The present invention relates to ( 1 ) a method of 

identifying an individual at an increased risk for 
developing cancer, (2) a method for determining a 
prognosis of patients afflicted with cancer, and (3) a 
method for determining the proper course of treatment for 

15 a patient afflicted with cancer. 

narrkground Information 

Lung cancer claims more lives in the United 

20 States than any other neoplasm (R.S. Finley, Am. Pharm. 
NS29, 39 (1989)), and of the various forms lung 
adenocarcinomas have one of the worst prognoses (T.P. 
Miller, semin. Oncol. 17, 11 (1990)). The incidence of 
adenocarcinoma of the lung (ACL) in the United States is 

25 also quickly rising (I. Linnoila, Hematol. Oncol. North. 
Am. 4, 1027 (1990); J.B. Sorensen, H.H. Hansen, Cancer 
Surviv. 8, 671 (1989)). In order to gain insight into 
this complex and deadly disease, a model system for its 
study has been developed. For such a model to provide 

30 clinically relevant data several criteria must be met. 

The tumors produced should be histologically equivalent to 
their human counterparts, tumor induction must be reliable 
and reproducible, and the numbers generated must be great 
enough to provide statistical significance. To satisfy 

35 these conditions a system has been created which uses two 
inbred mouse strains (NFS/n and AKR) along with 
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transplacental exposure to the potent carcinogen N-ethyl- 
N-nitrosourea (ENU) and promotion with the antioxidant 
butylated hydroxy toluene (BBS). The resulting tumors were 
examined for altered expression or structural mutations of 
5 genes implicated in lung tumor development such as ras^. 
mvc, and raf oncogenes (CD. Little et al. f Nature 306, 
194 (1983); P-E. Kiefer et al., Cancer Res,., 47, 6236 
(1987); E. Santos et al., Science 223, 661 (1984); S. 
Rodenhuis, N gngj - .t. Med. 317, 929 (1987); M. Barbacid, 
10 mc - Slim xnvst. 20, 225 (1990); O.K. Rapp et al. , i. 

xnt hssez* t ss . as sai&z ai feaag ^ cer - 4 * 162 (1988); 

M.J. Birrer et al., Ann, Rev. Med. 40, 305 (1989); G. 
Sithanandam et al. , Oncogene 4, 451 (1989)). 

15 £a£ proto-oncogenes are evolutionary highly 

conserved genes encoding cytoplasmic serine/ threonine 
specific kinases, which function in mitogen signal 
transduction (reviewed in U.R. Rapp et al., The Oncogene 
Handbook, T. Curran et al., Eds. (Elsevier Science 

20 Publishers, The Netherlands, 1988), pp. 115-154; O.R. 
Rapp, Oncogene. 6, 495 (1991)). The three known active 
members in the rM family encode phosphoproteins of 
similar size (72/74 kD for Raf-1; 68 kD for A-Raf-1 , and 
74 kD for B-Raf (O.R. Rapp et al., in R^ro yi ruses and 

25 Penology. R. Gallo et al., Eds. (Humana Press, 

Clifton, New Jersey 1985), PP- 449-472; T.W. Beck et al., 
SaclS^ ina. Res. 15, 595 (1987); G. Sithanandam et al., 
Oncogene 5, 1775 (1990))). Raf-1 was first identified as 
the cellular homologue of v-raf (H.W. Jansen et al., 

30 Nature 307, 218 (1984)), the transforming gene of 3611 MSV 
^Z~Rap P et al., ,T. Virol. 45, 914 (1983); U.R. Rapp et 
j rm- Say &Sa3 SSLJS& 80, 4218 (1983)). Amino 
acid comparisons of raf family genes shows three conserved 
regions [CR1 , CR2, CR3] (T.W. Beck et al., Nucleic Acids 

35 Res^ 15, 595 (1987)); CR1 is a regulatory region 

surrounding a Cys finger consensus seguence, CR2 is a 
serine/threonine rich region, and CR3 represents the 
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kinase domain. Raf-1 has been mapped to chromosome 3p25 
in humans (S.J. O'Brien et al., Science 223, 71 (1984)), 
and this region has been found to be frequently altered in 
small cell lung carcinoma (SCLC) (J. Whang-Peng et al., 
5 Cancer Genftfc. Cvtooenet. 6, 119 (1982); J.M. Ibson et al., 
J. Cell. Biochem. 33, 267 (1987)), familial renal cell 
carcinoma (A.J. Cohen et al., n. Engl. J. Med. 301, 592 
(1979); G. Kovacs et al., T"* - -- J - Cancer 40, 171 (1987)), 
mixed parotid gland tumors (J. Mark et al., Hereditas 96, 
10 141 (1982)), and ovarian cancer (K. Tanaka et al., Cancer 
Genet. Cvtooenet. 43, 1 (1989)). 

Raf genes are differentially expressed in 
various tissues (S.M. Storm et al., Oncogene 5, 345 

15 (1 990 ) ) . c -raf- 1 has been found to be expressed 

ubiquitously, though absolute levels vary between tissues. 
A-raf-1 is present predominantly in the urogenital 
tissues, whereas B-Raf is most abundant in cerebrum and 
testis. The ubiquitous c-Ra£-1 kinase is regulated by 

20 tyrosine and serine phosphorylations that result from 

activated growth factor receptor kinases (D.K. Morrison et 
al., Cell 58, 648 (1989); D.K. Morrison et al., Proc. 
Natl. Acad. Sci. OSA 85, 8855 (1989); K.S. Kovacina et 
al., .7. Biol. Chem. 265, 12115 (1990); P.J. Blackshear et 

25 al., -T. Biol. Chem. 265, 12131 (1990); M.P. Carroll et 

al., .7- Riol. Chem. 265, 19812 (1990); J.N. Siegel et al., 
.7. Biol. Chem. 265, 18472 (1990); B.C. Turner et al., 
Proc. Natl. Ar.afl. Sci. USA 88, 1227 (1991); M. Baccarini 
et al., EMBO J. 9, 3649 (1990); H. App et al., Mol. Cell. 

30 Biol. 11,- 913 (1991)). Raf-1 operates downstream of Ras 
in mitogen signal transduction as judged by experiments 
using antibody microinjection (M.R. Smith et al., Nature 
320, 540 (1986)), c-raf-1 antisense expression constructs 
(IT. Kolch et al., Mature 349, 426 (1991)), dominant 

35 negative mutants (W. Kolch et al., Nature 349, 426 

(1991)), and Raf revertant cells. Studies with NIH3T3 
cells and brain tissue demonstrated that mitogen treatment 
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induces Raf-1 kinase activity and causes a transitory 
relocation of the active enzyme from the cytoplasm to the 
nucleus and perinuclear area (Z. Olah et al., Kxp. Brain. 
Res . (in press); O.R. Rapp et al., in Col* Spring Harbor 
5 ^r^i* on Qu anttfeafcjvS Biology, Vol. LIU, Eds. (Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, NY 
1988) pp. 173-184). 

Raf-1 coupling has been examined in more than a 
10 dozen receptor systems and all strong mitogens stimulated 

Raf-1 kinase activity (O.R. Rapp, Ojicooene 6, 495 (1991); 

dTI Morrison et al., Cell 58, 648 (1989); D.K. Morrison 

et al., rrn- — ™A 85, 8855 (1989); K.S. 

Kovacina et al., .T Biol. Chem. 265, 12115 (1990); P.J. 
15 Blackshear et al., J , BteL Chem.. 265, 12131 (1990); M.P. 

Carroll et al., - - 19812 (1990); J ' N * 

Siegel et al., fl BlflL Chem. 265, 18472 (1990); B.C. 

Turner et al., rrr - -~ lead S^ OSA 88, 1227 (1991); 

M. Baccarini et al., EMBO J. 9, 3649 (1990); H. App et 
20 al., MoT Cell. Biol. 11, 913 (1991)), and this 

stimulation correlated with an increase xn ga£-1 

phosphorylation leading to a shift in apparent molecular 

weight. 



25 



30 



35 



sommary of the invention 

It is an object of this invention to provide a 
method of identifying an individual at an increased risk 
for developing cancer. 

It is another object of this invention to 
provide a method for determining a prognosis in patients 
afflicted with cancer. 

It is a further object of this invention to 
provide a method for determining the proper course of 
treatment for a patient afflicted with cancer. 
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Further objects and advantages of the present 
invention will be clear from the description that follows. 

In one embodiment, the present invention relates 
5 to a method of identifying an individual at an increased 
risk for developing cancer, comprising: 

amplifying a region of the c-raf-1 gene; 
analyzing products of the amplification for evidence 
of mutation; and 
10 classifying an individual having one or more 

mutations in the region as having an increased risk for 
developing cancer. 

In another embodiment, the present invention 
15 relates to a method for determining a prognosis in 
patients afflicted with cancer, comprising: 
amplifying a region of the c-raf-1 gene; 
analyzing products of the amplification for evidence 
of mutation; and 
20 classifying patients having no mutation in said 

region as being less likely to suffer disease relapse or 
having an increased chance of survival than those patients 
having one or more mutations in said region. 

25 in a further embodiment, the present invention 

relates to a method for determining the proper course of 
treatment for a patient afflicted with cancer, comprising: 
amplifying a region of the c-raf-1 gene; 
analyzing products of said amplification for evidence 

30 of mutation; 

identifying a patient having at least one mutation in 

said region, which patient may require treatment proper 

for patients having a lesser chance of survival or 

decreased time to relapse; and 
35 identifying a patient lacking mutations in said 

region, which patients may require treatment proper for 
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patients having a greater chance of survival or being less 
likely to suffer disease relapse. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 - Effect of BHT promotion on ENU 
tumorigenesis in NFS/n x AKR mice. The X-axis represents 
percent tumor induced mortality within each group, and the 
Y-axis reflects age in weeks. All animals were exposed to 

10 ENU transplacental^ at a dose of 0.5mM/Kg of mother's 
body weight on day 16 of gestation (presence of vaginal 
plug was scored as day one) . At two weeks of age mice 
were weaned into two separate groups and separated by sex. 
Both groups were housed in identical cages and supplied 

15 with food (Purina Lab Chow) and water ad libitum . 

Beginning at three weeks of age, group 2A (0) was given 
weekly intraperitoneal (i.p.) injection of com oil (0.1 
ml), and group 2B (o) received weekly i.p. injections of 
BHT (20 mg/Kg of body weight) dissolved in corn oil. 

20 Administration of BHT reduces the mean age of mortality 
from approximately 20 weeks to 13, and decreases the 
initial age of mortality. These curves are significantly 
different (ps 0.001) as judged by a 2-tailed Cox test. In 
both groups the rate of tumorigenesis was identical for 

25 males and females. 

Figure 2. Northern blot analysis of proto- 
oncogene expression levels in ENU induced tumors. 

30 Figure 3. Diagnostic digestion of PCR amplified 

Ki-ras genes from ENU induced tumors. Genomic DNA was 
isolated from a cesium chloride gradient during RNA 
preparations. In each case 10 ng was amplified via PCR 
(95°C, 5 min. followed by 35 cycles of 95*C, 1 mm. -» 

35 55-C, 1 min. -» 72», 1 min.) with 2 units of Tag I 
polymerase. The primers used (K1 ; 5'- 
AACTTGTGGTGGTTGGACCT— 3 ' -» (SEQ ID N0:6) and K2; <= 3'- 
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GTCTTAGTGAAACACCTACT-5 ' (SEQ ID NO:7)) generate a 79 b.p. 
product. The primer Kl ends at codon 12 and contains a 
mismatch from normal mouse (Ki-ras. sequence its 18th 
nucleotide (G •* C) creating a BstNI site (CCTGG) in 
5 conjunction with a normal codon 12 (GGT) . Digestion of 
amplified product from a normal allele with BstNI produces 
two products of 19 and 60 b.p., whereas a mutation in one 
of the first two positions of codon 12 will eliminate the 
BstNI site. The presence of two normal alleles results in 

10 all of the product being cleaved and the presence of one 
mutant and one normal allele will result in only half of 
the product cut. In the three panels each sample was run 
in duplicate, uncut and cut with BstNI. F1 is DNA from an 
untreated NFS/n X AKR F1 mouse, and MCA5 is a murine cell 

15 line known to harbor a mutant K-ras codon 12 allele. One 
lymphoma (24Ly) and one cell line (117; derived from a 
lung adenocarcinoma) display a mutated Ki-£aj. codon 12 
allele; however, 24Ly was a passaged tumor and examination 
of the original tumor showed two normal alleles indicating 

20 that this mutation was acquired during passaging. 

Figure 4. c-raf-1 RNAse protection analysis of 
ENU induced tumors. The probe used was a 32P labeled 
antisense transcript from the 3' non-coding region of a 

25 mouse c-raf-1 cDNA to the 3' most StuI site. 

Hybridization of this probe with normal RNA results in a 
protected fragment of 1.2kb covering the region encoding 
the Raf-1 kinase domain. One ug of poly (A) + RNA from each 
tumor and 5 ug of F1 RNA (in order to get comparable 

30 signals) was hybridized for 12 hours at 52'C with 200,000 
cpm of *P labeled mouse c-raf antisense transcript. 
Hybrids were then digested for 30 minutes with 25 ug RNAse 
A and 33 units of RNAse T1 at room temperature. Digested 
hybrids were then incubated with 50 ug of proteinase K, 

35 phenol/chloroform extracted, ethanol precipitated, and 
resuspended in loading dye containing 80% formamide. 
Samples were then run on 6% polyacrylamide denaturing 
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seguencing gels at 65 watts. Gels were vacuum dried at 80 
degrees C and exposed to X-ray film. Probe is ~ted 
probe alone; tRNA is probe hybridized to non-specific RNA; 
v-raf is probe hybridized to RNA from a v-raj: transformed 
5 cell line and the bands detected represent sin ^^ 6 
etches between murine c-raf and v-raf.; NFS/AKR F1 x 
probe hybridized with RNA from a normal (untreated) mouse, 
24 LY is probe hybridized with RNA from a lymphoma; and 
the remaining lanes are probe hybridized with RNA isolated 
10 from lung tumors. 

Figure 5. Schematic of Raf-1 protein showing 
sites of END induced mutations. CR1 # CR2, and CR3 
represent conserved regions 1 , 2 and 3. cDNAs were made 
15 from tumor derived poly(A)+ RNA using MoMuLV reverse 
transcriptase. Primers (MR1 sequence and MR2 sequence) 
encompassing a 435 base pair region c-sf were then used 
to amplify this region via PGR. The amplification mature 
W as then run on 1.7% agarose gels and the 435 bp product 
20 isolated. This isolated fragment was then treated with T4 
Polymerase and cloned into the Hindi site of 
sequencing. Another set of primers (EMR1 sequence and 
EMR2 sequence) was designed containing EcoRI sites at the 
termini and used to amplify a 609 base pair region 
25 (encompassing the original 435 base pair re ^° n) - 

isolated products from these reactions were then dxgested 
with EcoRI and cloned into the EcoRI site of KS. 
sequencing reactions were carried out using the Sequenase 
kit (USB) according to the recommended protocols for 
30 single and double stranded sequencing. Sequencing 

reactions were run on 6* polyacrylamide denaturing gels at 
65 watts. Gels were vacuum dried at 80 degrees c and 

, _.- Tn eac h case a normal allele was 

exposed to X-ray film, in eaca 

also sequenced along with the mutant allele. 



35 
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Figure 6. Schematic for Identifying c-raf-1 
mutations . Primers 1 and 2 are shown in SEQ ID NO: 8 and 
SEQ ID NO: 9, respectively. 

5 DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to methods that 
involve amplifying a region of the c-raf-1 gene (the 
sequence of a mouse c-raf-1 gene is shown in SEQ ID NO: 10; 
10 the nucleotide and corresponding amino acid sequence of a 
human c-raf-1 gene is shown in SEQ ID NO: 11 and SEQ ID 
NO: 12, respectively). 

In one embodiment, the present invention relates 

15 to a method of identifying an individual at an increased 
risk for developing cancer (preferably, lung cancer, T- 
cell lymphomas, renal cell carcinoma, ovarian carcinoma, 
and mixed parotid gland tumors) comprising: amplifying a 
region (preferably by using the polymerase chain reaction 

20 method(PCR) by cloning techniques) of the c-raf-1 gene 
of the individual (SEQ ID N0:11)(in one preferred 
embodiment, the region encodes amino acids 514 to 535 of 
SEQ ID NO: 12); analyzing products of the amplification for 
evidence of mutation (preferably by DNA sequencing of the 

25 region) and classifying an individual having one or more 
mutations in the region as having an increased risk for 
developing cancer. In one prefered embodiment, the region 
encodes amino acids 500 to 550 of SEQ ID NO: 12 or amino 
acids 450 to 630 of SEQ ID NO: 12. In another prefered 

30 embodiment, the PCR method employs a primer comprising the 
sequence shown in SEQ ID NO: 7 and a primer comprising the 
sequence shown in SEQ ID NO: 8. In another prefered 
embodiment, the method comprises the steps shown in Figure 

35 in another embodiment, the present invention 

relates to a method for determining a prognosis in a 
patient afflicted with cancer (preferably, those cancers 
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listed above). The method comprises: amplifying the 
region of the c-raf-1 gene as described above; analyzing 
products of the amplification for evidence of mutation as 
described above; and classifying a patient having no 
5 mutation in the region as being less likely to suffer 

disease relapse or having an increased chance of survival 
than a patient having one or more mutations in the region. 

in another embodiment, the present invention 
10 relates to a method for determining the proper course of 
treatment for a patient afflicted with cancer (preferably, 
those cancers listed above), comprising: amplifying a 
region (described above) of the c-raf-1 gene as described 
above; analyzing products of the amplification for 
15 evidence of mutation as described above; identifying a 

patient having at least one mutation in the region, which 
patient may require treatment proper for patients having a 
lesser chance of survival or decreased time to relapse; 
and identifying a patient lacking mutations in the region, 
20 which patients may require treatment proper for patients 
having a greater chance of survival or being less likely 
to suffer disease relapse. 

Administration of therapeutic agents (cytotoxic 
25 or cytostatic) tailored to recognize the mutant Raf-1 
protein but not normal Raf-1 could specifically target 
tumor cells for death of growth inhibition. Such agents 
could be comprised of cytotoxic T-cells, antibodies, 
and/or specifically designed chemical compounds. 

30 

The following Examples demonstrate consistent 
point mutations of the c-raf-1 proto-oncogene, within a 
small region of the kinase domain, in a mouse model for 
chemical tumor induction. This is the first demonstration 
35 of point mutated raf genes ia visa, and the first 

isolation of activating in vivo point mutations in the 
kinase domain of a proto-oncogene. The tumors examined 
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show a selective specificity for Raf-1 mutations as 
another family of genes, the ras proto-oncogenes which are 
frequently activated by point mutation in both animal and 
human tumors (S. Rodenhuis et al., Am. Rev. Resoir. Pis. 
5 142, S27-30; T.R. Devereux et al., Carcinogenesis 12, 299 
(1991)), is not involved. 

The present invention is described in further 
detail in the following non-limiting examples. 

10 

EXAMPLES 

The following protocols and experimental details 
are referenced in the examples that follow: 

15 RNA Isolation. Tumors were excised, a small portion 
minced in PBS (phosphate buffered saline solution) for 
passaging in nude mice, frozen immediately in a dry 
ice/ethanol bath, and stored at -70° until RNA extraction. 
Frozen tissues were minced on wet ice in a guanidine 

20 thiodyanate buffer (4M guanidine thiocyanate 10mM EDTA, 2% 
N-lauryl sarcosine, 2% beta-mercaptoethanol, lOmM Tris 
(pH=7.6)), disrupted in a Dounce homogenizer, and 
extracted three times with phenol: chloroform: isoarayl 
alcohol (24:24:2). Supernatants were then transferred to 

25 SW41 tubes, 100 jig of cesium chloride per ml was added to 
the supernatant which was then underlayed with one half 
saturated cesium chloride in 10mM EDTA (pH=7.0; index of 
refraction 1.3995-1.4000), and centrifuged at 25,000 rpm 
for 20 hours in a Sorvall SW-41TI rotor using a Beckman 

30 model L5-50 ultracentrifuge. Supernatants were removed 
and RNA pellets dissolved in 4 ml resuspension buffer (10 
mM Tris-HCl pH=7.6, 5% beta-mercaptoethanol, 0.5% N-lauryl 
sarcosine, 10 mM EDTA), extracted once with 
phenol: chloroform :isoamyl alcohol, sodium acetate added to 

35 0.1 2M and RNA precipatated with two volumes ethanol at - 
20°C overnight. Precipitates were centrifuged at 9,000 rpm 
in a Sorvall SS-34 rotor for 30 minutes, and pellets 
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redissolved in RNA sample buffer (10 mM Tris P H=7.4, M 
EDTA, 0.05% sodium dodecyl sulfate) and concentrations 
determined by absorbance at 260 nm. Poly (A)* RNA was 
isolated by binding to oligo dT cellulose columns in high 
salt (10mM Tris pH=7.4, ImM EDTA, 0.05% SDS, 500mM NaCi), 
and eluting with RNA sample buffer heated to 40°C. 

Seamen, Rlnfctinq. 5 yg poly(A)' RNA per lane was ethanol 
precipitated, desiccated, resuspended in loading buffer 
(20mM MOPS P H=6.8, 5mM sodium acetate, ImM EDTA, 50% 
formamide, 6% formaldehyde), heated at 65'C for 5 min., 
quick chilled on wet ice for 10 min., and electrophoresed 
through a 0.7% agarose gel containing 2.2 M formaldehyde, 
20mM MOPS [pH=6.8], 5mM sodium acetate, and ImM EDTA. 
Gels were then blotted overnight onto nitrocellulose 
filters via capillary transfer in 20X SSC, filters were 
washed in 3X SSC for 10 min. and baked at 80«C for 2 
hours. 

^■M^fcions. Filters were prehybridized at 42»C in 5X 
SSC, 50% formamide, 20mM sodium phosphate P H=6.8, 200 
ug/ml PVP-40, 200 yg/1 fi«U 400, 200 ug/ml bovine serum 
albumin, and 200 ug/ml sonicated sheared salmon sperm DNA. 
Blots were then hybridized with 500,000 cpm/ml of random 
25 primed *P labeled probes overnight at 42°C in 

prehybridization solution with 5% dextran sulfate. Blots 
were washed with agitation in 2X SSC, 0.1% SDS at room 
temperature six times for 20 minutes each wash, then 
washed once at 45»C in 0.1X SSC for 15 minutes. Filters 
30 were exposed to X-AR 5 film at -70°C. 



20 



35 



EXAMPLE 1 
Tumor Induction 



NFS female mice were mated with AKR males and 
pregnant females given a transplacental injection of 1- 
ethyl-1-nitrosourea (END) at a dosage of 0.5 mM/Kg 
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mother's body weight on day 16 of gestation, counting plug 
date as day one. ENU was chosen for tumor induction since 
it is a very potent direct acting carcinogen capable of 
modifying any base in vivo (Singer, B. et al., 1983. 

5 Molecular Bioloov of Mutagens and Carcinogens, Plenum 
Press, New York). ENU alkylates all tissues with roughly 
the same efficiency (E. Scherer et al., Cancer Lett. 46, 
21 (1989)) and has a very short half life in vivo (E.M. 
Faustman et al., Teratology 40, 199 (1989)) allowing 

10 specific mutagenesis of tissues which are mitotically 
active at a particular time. NFS and AKR were chosen as 
parental strains based on earlier studies which showed 
them to be particularly susceptible to ling tumors 
following ENU exposure (B.A. Diwan et al., Cancer Res. 34, 

15 764 (1974); S.L. Kauffman, JNCI 57, 821 (1976)). With 
this procedure nearly 100% of the offspring develop lung 
adenocarcinomas and approximately 70% develop, in 
addition, T-cell lymphomas with a mean latency of 
approximately 20 weeks. In order to achieve more rapid 

20 tumor development, weanling mice were treated with weekly 
injections of a tumor promoter, the antioxidant butylated 
hydroxy toluene or BHT (20mg/kg body weight dissolved in 
corn oil). BHT was used as it has been demonstrated to 
cause lung, lesions and hyperplasia when injected into mice 

25 (A. A. Marino et al., Prnn. Soc. Exp. Biol. Med. 140, 122 
(1972); H. Witschi et al., Prnc. Soc. Kxp. Biol. Med., 
147, 690 (1974); N. Ito et al., CRC Crit. Rev. Toxicol. 
15, 109 (1984)). In the present system it nearly doubles 
the rate at which tumors develop. Figure 1 compares tumor 
1 30 induced mortality with age of animals for those receiving 
ENU alone, and those receiving ENU and promoted with BHT. 
These curves demonstrate that when BHT is given the mean 
age of tumor induced mortality decreases from 
approximately 20 weeks to around 12, and there is also a 

35 decrease in initial latency. These curves are 

significantly different with a confidence limit greater 
than 99.99% using a 2-tailed Cox test. In addition, BHT 
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promotion, while increasing the rate at which tumors 
develop, does not affect the tumor spectrum. 



25 



30 



KXAMPLE 2 
nnnnaen^ Repression 



Northern blot analysis revealed elevated levels 
of c-raf-1, as compared to normal tissue, in every tumor 
examined (Figure 2), and Western blot analysis showed that 

10 protein levels correlated with message levels (0 R Rapp 
et al., in On^oa^^ancer, S.A. Aaronson et al 
Eds. (Tokyo/VNO Scientific Press, Tokyo, 1987) pp. 55-74). 
in addition, in cell lines derived from primary tumors 
Raf-1 protein kinase activity was shown by immune-complex 

15 kinase assays to be constitutive. Further analysis of 
other oncogenes revealed no consistent pattern of 
expression except for ras and myc family genes. In the 
case of the myc family, one member (either c-, N-, or L- 
KE) was overexpressed but never more than one. For the 

20 ras genes, at least one member (Ki-, Ha-, or N-r«yD, and 
o^en more than one, was expressed at high levels when 
compared with the normal tissue. In addition all 
oncogenes examined via Northern analysis exhxb.ted full 
length, normal sized transcripts. 

ras genes were considered likely candidates for 
national activation since oncogenic forms of Ki-r^ have 
previously been observed in lung tumors (S. Rodenhuxs et 

i-> oqq (1991)) and ENU xs a point 
mutagen (Singer, B. et al., 1983. Modular Biology of 
TTllt -^„ ^ r^inoaens. Plenum Press; Hew York). A 
systematic analysis of various ras codons known to be 
involved in oncogenic activation was therefore performed . 
Ha- Ki-, and N— ras were examined at codons 12, 13, and 
61 for potential mutations via RNAse protection assays 
(R.M. Myers et al., 230, 1242 (1985); E. Winter et 



35 
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al., Prnr Natl. Acad , Sci . USA 82, 7575 (1985)), PCR 
amplification followed by subsequent sequencing (F. Sanger 
et al., .7 Mnl. Biol. 13, 373 (1975)), and PCR 
amplification followed by diagnostic restriction digests 
5 (W. Jiang et al., oncogene 4, 923 (1989)). PCR 

amplification creating diagnostic enzyme sites is a very 
efficient way of examining alleles for mutations at known 
sites and involves designing a PCR primer whose 3' end 
lies next to and produces a novel restriction site 
10 encompassing the codon of interest. Following 

amplification, PCR products from normal alleles will 
contain the new restriction site, while mutant alleles 
will not. Digestion of the product from tissue with two 
normal alleles results in all product being cut; however, 
15 if one allele contains a mutation, only half of the 

product will be digested. Figure 3 shows the results of 
amplification and diagnostic digestion applied to Ki-ras. 
codon 12 in several tumors and cell lines. The first 
panel is from a set of lymphomas. F1 is DNA from a normal 
20 untreated mouse and both alleles are cut by BstNl, 

indicating the presence of two normal alleles. MCA5 is a 
murine cell line known to contain a Ki-ras codon 1 2 
mutation (L.F. Parada et al., t Cell. Biol. 3, 2298 
(1983)), and only the amplified normal allele is cleaved. 
25 Of the five tumors shown in the second panel, one shows a 
mutant Ki-£gs allele. The next panel shows some of the 
lung tumors tested and none of them exhibit a mutant 
allele, and the final panel shows tumor derived cell 
lines. The first three are from lymphomas and the last 
30 three from lung adenocarcinomas. One lung tumor line 
<#117) has a Ki-ras 12 mutation that was not present in 
the primary tumor but came up upon transplantation. This 
analysis has been performed with Ki, Ha and N-ras genes at 
codons 12 and 61 . Of all the tumors and cell lines 
35 examined by this method for mutations of the three ras 
genes at codons 12 and 61, the two shown here were the 
only ones detected. Examination of codon 13 was done by 
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PCR amplification of genomic DHA surrounding codon 13 
followed by cloning into KS+ (Stratagene) and double 
stranded sequencing. Table I summarizes the r^ mutation 
data. The most notable point from this table is the 
5 conspicuous lack of ras mutations in these tumors. In 
fact the number of ras mutations is much lower than would 
be expected for a sampling of spontaneous tumors <S. 
Rodenhuis et al., Bm Bgv Respir. Pis. 142, S27-30; T.R. 
Devereux et al„ Carciiicaengsis 12, 299 (1991 ); J.L. Bos, 
10 CancerRes^ 49, 4682 (1989)). Having eliminated ras genes 
as playing a primary role in the genesis of these END 
induced tumors, c-raf-1 was investigated for possible 
small or point mutations. 

15 TABLE I 

— — .n* r»n Lin e- fnr ras Mutations 

Codon 12 Codon 13 Codon 61 



20 Ha-ras 
Ki/ras 
N-ras 



Tumors 


Cell 
Lines 


Tumors 


Cell 
Lines 


Tumors 


Cell 
Lines 


0/10 


0/6 


0/6 


0/2 


0/10 


0/6 


i7io 


1/6 


0/6 


0/2 


0/6 


0/2 


0/10 


0/6 


0/6 


0/2 


0/10 


0/6 



• This was a second passage tumor in which the original 
25 tumor did not contain a Ki-ras mutation. 

Table I: Summary of mutation analysis for Ha-, K1-, and 
N-ras. at codons 12, 13, and 61. Each box displays the 
number of mutations detected, over the number of tumors 
30 and tumor derived cell lines examined via RNAse 

protection, sequencing or diagnostic digestion, for each 
of the nine codons. 

BY AMPLE 3 

35 BafcafciSBg, -in Raf-1 

Since no point mutations had been described 
for raf genes in vivo, as had been for the ras genes (E. 
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Santos et al., Science 223, 661 (1984); S. Rodenhuis, IL_ 
Rnal. J. Med. 317, 929 (1987); M. Barbacid, Eur. J. 
Clin. Invest. 20, 225 (1990); F. Sanger et al., J. Mol. 
Biol. 13, 373 (1975)), point mutations were screened for 

5 using RNAse protection assays (R.M. Myers et al., 

Science 230, 1242 (1985); E. Winter et al., Proc. Natl. 
Acad. Sci. USA 82, 7575 (1985)). Figure 4 shows a 
typical protection assay using a c-raf-1 probe. In this 
experiment the probe used covered the 3' end of raf-1 

10 from the 3' most StuI site to the end of the coding 

sequence. The first lane is a marker (pBR322 digested 
with Haelll), the second shows the probe alone 
undigested, the third lane shows the probe hybridized to 
unrelated RNA in this case tRNA, the fourth lane shows 

15 hybridization with v-raf transformed cells and the lower 
bands represent cleavage at points where the mouse c- 
raf- 1 gene differs from v-raf- Tne fifth lane shows 
hybridization with RNA isolated from a normal lung of an 
untreated F1 mouse, the next lanes are RNA isolated from 

20 several tumors. In the case of the normal RNA, only 

one, fully protected, band is detected while in the case 
of the tumors two major bands are seen after digestion. 
20 out of 20 tumors analyzed in this fashion showed this 
extra band. These data demonstrate the following major 

25 points: 1) there is a tumor specific alteration in c- 
raf- 1 that results in a region of non-homology 
recognizable by either RNAse A or T1 ; 2) The alterations 
are confined to the same region of one allele as two 
bands of equal size are present in the tumor lanes, and; 

30 3) both alleles were expressed at comparable levels as 
both bands are of approximately equal intensity. In the 
assay shown 5 ug of poly (A) + RNA was hybridized from 
normal tissue, and 1 ug was used from the tumors. This 
was necessary to get signals that could be compared on 

35 the same gel due to the overexpression of c-raf-1 in the 
tumors. By running these assays with various markers it 
was possible to estimate the approximate site of the 
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alteration(s) to be in the vicinity of the exon 14/exon 
15 junction. In order to define the precise genetic 
alteration or alterations, PGR primers were designed 
which would generate a 600 bp fragment encompassing thxs 
5 region. cDNAs from tumor derived RNA were then 

amplified and cloned into KS + (Stratagene) for double 
stranded sequencing. The sequencing results from 
several tumors are shown in Figure 5. The top portion 
of Figure 5 presents a cartoon of the mouse Raf-1 
n0 protein. There are three conserved regions CR1 CR2 and 
CB3 with CR3 representing the kinase domaxn. The probe 
used in the RNAse protection assays covers the xndxcated 
area, and the PCR primers amplified the bracketed 
region. Sequencing through this area revealed a varxety 
15 of mutations just downstream of the APE site These 
mutants are shown in an expanded version at the bottom 
of Figure 5 (See also SEQ ID N0:1 for normal mouse 
sequence and SEQ ID NO: 2, 3, 4, and 5 for mutant 
sequences). These mutants were isolated *~ 
separate tumors, and in each case a normal allele (SEQ 
ID N0:1) was also sequenced. Repeating the cDNA 
synthesis, PCR amplification, cloning and seguencxng 
gives the same sequence and normal tissue shows no 
stations demonstrating that these alterations are not 
artifactual. Sequence covering the amplifxed regxon has 
been examined and it is interesting that all of these 
changes occur within a very small region of the raf 
protein. In fact the region where these mutatxons occur 
overlaps an epitope shared by monoclonal antibodxes 
generated against raf (W. Kolch et al. , Qscoo^ 5, 713 
(1990)), and computer modeling of the protexn shows thxs 
to be a hydrophilic domaxn, the structure of which xs 
predicted to be altered by these mutations. Thxs 
indicates a biologically important region for the 
molecule and indeed the first of these mutatxon tested 
in NIH3T3 cell assays, after cloning into a retroviral 
expression vector (El-neo, (G. Heidecker et al., MoU 



20 



25 



30 



35 
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Cell. Biol, 10, 2503 (1990))), was found to be weakly 
transforming when driven by a Moloney LTR. The 
transformation efficiency was comparable to EC2, a 
previously characterized mutation of human c-raf-1 cDNA 
5 (G. Heidecker et al., Mol. Cell. Biol. 10, 2503 (1990); 
C. Wasylyk et al., Mol. Cell. Biol. 9, 2247 (1989)) and 
-20 fold lower than the v-raf oncogene. 

★ * * * * 
10 All publications mentioned hereinabove are 

hereby incorporated in their entirety by reference. 

While the foregoing invention has been 
described in some detail for purposes of clarity and 
15 understanding, it will be appreciated by one skilled in 
the art from a reading of this disclosure that various 
changes in form and detail can be made without departing 
from the true scope of the invention and appended 
claims . 



WO 93/06248 



PCT/US92/07817 



-20- 

SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) APPLICANT: Rapp, Ulf R. 

Storm, Stephen H. 

(ii) TITLE OF INVENTION: DETECTION METHOD FOR C-RAF-1 GENES 
(iii) NUMBER OF SEQUENCES: 12 

fivl CORRESPONDENCE ADDRESS: _ „ „ 

1 } (A) ADDRESSEE: NEEDLE & ROSENBERG, P.C. 

(B) STREET: Suite 400, 133 Carnegie Way, N.W. 

(C) CIH: Atlanta 

(D) STATE: Georgia 

(E) COUNTRY: USA 

(F) ZIP: 30303-1031 

fv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Perryman, David G. 

(B) REGISTRATION NUMBER: 33,438 

(C) REFERENCE/DOCKET NUMBER: 1414.0421 

fixl TELECOMMUNICATION INFORMATION: 
OXJ ™±om EpHMEs (4Q4) fi88 _ 0 7 70 

(B) TELEFAX: (404) 688-9880 



(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly 
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Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 30 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 45 

Thr Asp Ser Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 55 60 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 75 80 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 90 95 

Ala Val Phe Arg Leu Leu Gin Glu His Lys Gly Lys Lys Ala Arg Leu 
100 105 110 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 
115 120 125 

Asp Phe Leu Asp His Val Pro He Thr Thr His Asn Phe Ala Arg Lys 
130 . 135 140 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 155 160 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 
165 170 175 

Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn He Arg Gin 
180 185 190 

leu Leu Leu Phe Pro Asn Ser Thr Val Gly Asp Ser Gly Val Pro Ala 
195 200 205 

Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 
210 215 • 220 

Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 235 240 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 
245 250 255 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 
260 265 270 

Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 
275 280 285 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
290 295 300 
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Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 315 

Gly Ser Gly Thr Gin Gin Lys Asn Lys lie Arg Pro Arg Gly Gin Arg 
325 330 

Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 
340 345 J3U 

Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 

355 360 
His Gly Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 

370 375 
Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 425 
Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
435 



Leu His Val Gin Glu Thr Lys rne u.n neu r..* «... «~ 
435 440 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 

450 * 55 
He His Ar, Asp Met Lys Ser Asn Asn He Phe Leu His Glu G1, leu 
465 470 

Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 

Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 



500 



Ala Pro Glu Val He Arg Met Gin Asp Asp Asn Pro Phe Ser Phe Gin 

515 520 
Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Ala Gly 

530 535 
Glu Leu Pro Tyr Ala His lie Asn Asn Arg Asp Gin He He Phe Met 
545 550 

Val G1y Ar, 61, Tyr Ala Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 



565 



Cys Pro Lys Ale Met Lys Arg Leu Val Ala Asp Cys Val lys lys Val 
580 003 
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Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
595 600 605 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
610 615 620 

Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 
625 630 635 640 

Thr Ser Pro Arg Leu Pro Val Phe 
645 



(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 

Met Glu His He Gin Gly Ala Trp Lys Thr lie Ser Asn Gly Phe Gly 
1 5 10 15 

Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 30 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 45 

Thr Asp Ser Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 55 60 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 ■ 70 75 80 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 90 95 

Ala Val Phe Arg Leu Leu Gin Glu His Lys Gly Lys Lys AU Arg Leu 
100 105 110 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 
v 115 120 125 

Aso Phe Leu Asp His Val Pro He Thr Thr His Asn Phe Ala Arg Lys 
V 130 135 140 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 155 160 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 
165 170 175 
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Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn lie Arg Gin 

180 185 
Leu Leu Leu Phe Pro Asn Ser Thr Val Gly Asp Ser Gly Val Pro Ala 

Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 

210 215 ZZ 

Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 235 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 

260 265 
Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 
275 2°^ 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
2 go 295 300 

Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 

Gly Ser Gly Thr Gin Gin Lys Asn Lys lie Arg Pro Arg Gly Gin Arg 
325 



Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 

Thr Val Tyr 
35I "" ' ' *>Q 365 



340 

Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 



His Gly Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 
370 375 aou 

Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 425 



Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
435 440 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn lie 
450 455 
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Ile His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 
465 470 475 480 

Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 490 495 

Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 
500 505 510 

Ala Pro Glu Val Val Arg Met Gin Asp Asp Asn Pro Phe Ser Phe Gin 
515 " 520 525 

Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Ala Gly 
530 535 540 

Glu Leu Pro Tyr Ala His He Asn Asn Arg Asp Gin He He Phe Met 
545 550 555 560 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 
565 570 575 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 585 590 

Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
595 600 605 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
610 615 620 

Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 
625 630 635 640 

Thr Ser Pro Arg Leu Pro Val Phe 
645 

(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
I 5 10 15 

Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 30 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 45 
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Thr Asp Ser Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 55 60 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 75 ou 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 

Ala Val Phe Arg Leu Leu Gin Glu His Lys Gly Lys Lys Ala Arg Leu 

100 105 
Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 



115 



Asp Phe Leu Asp His Val Pro He Thr Thr His Asn Phe Ala Arg Lys 

130 135 
Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 X33 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His g Cys 
165 * 

Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn lie Arg Gin 
180 185 

Leu Leu Leu Phe Pro Asn Ser Thr Val Gly Asp Ser Gly Val Pro Ala 
195 200 fcua 

Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 

210 215 
Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 " 3 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 



260 



Asp Ser Arg Met He Glu Asp Ala lie Arg Ser His Ser Glu Ser Ala 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 

290 295 
Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 

Gly Ser Gly Thr Gin Gin Lys Asn Lys lie Arg Pro Arg Gly Gin Arg 
325 ^ u 
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Asd Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 
340 345 350 

Thr Arg He G1y Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 360 365 

His Gly Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 
370 375 380 

Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 395 400 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
405 410 415 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 425 430 

Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
435 440 445 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 
450 455 460 

He His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 
465 470 475 480 

Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 490 495 

Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 
500 505 510 

Ala Pro Glu Val He Arg Met Gin Asp Asn Asn Pro Phe Ser Phe Gin 
515 520 525 

Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Ala Gly 
530 535 540 

Glu Leu Pro Tyr Ala His He Asn Asn Arg Asp Gin He He Phe Met 
545 * 550 555 560 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 
565 570 575 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 585 590 

Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
595 600 605 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
610 615 620 
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Leu His Arg Ala Ala His Thr Gin Asp He Asn Ala Cys Thr Leu Thr 

Thr Ser Pro Arg Leu Pro Val Phe 
645 

(2) IKFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(x1) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Glu His lie Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
1 5 u 

Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 

40 •» 



35 



Thr Asp Ser Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 



50 55 60 



Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
Ala Val Phe Arg Leu Leu Gin Glu His Lys Gly Lys Lys Ala Arg Leu 



100 



Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 

115 12° 
Asp Phe Leu Asp His Val Pro He Thr Thr His Asn Phe Ala Arg Lys 

130 135 1 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 ISO 133 

As* Gly Phe Ar, Cys Gin Thr Cys Gly Tyr Lys Phe His Glu Bis Cys 
165 

Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn He Arg Gin 
180 185 

Leu Leu Leu Phe Pro Asn Ser Thr Val Gly Asp Ser Gly Val Pro Ala 
ig5 ZOO fcua 
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Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 
210 215 220 

Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 235 240 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 
245 250 255 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 
260 265 270 

Asp Ser Arg Met lie Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 
275 280 285 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
290 295 300 

Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 315 320 

Gly Ser Gly Thr Gin Gin Lys Asn Lys He Arg Pro Arg Gly Gin Arg 
325 330 335 

Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 
340 345 350 

Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 360 365 

His Gly Asp Val Ala Val Lys He Leu Lys. Val Val Asp Pro Thr Pro 
370 .375 380 

Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 395 400 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
405 410 415 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 425 430 

Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
435 440 445 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 
450 455 460 

Jle His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 
465 470 475 480 

Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 490 495 
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Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 
500 505 3AU 

Ala Pro Glu Val He Arg Met Gin Asp Asp Asn Pro Phe Ser Ser Gin 
515 520 s " 

Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Ala Gly 
530 535 MU 

Glu Leu Pro Tyr Ala His lie Asn Asn Arg Asp Gin He He Phe Met 
545 550 555 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 
565 5/Q 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 585 35 * u 

Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
1 595 600 60b 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 

610 615 
Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 
625 630 635 

Thr Ser Pro Arg Leu Pro VaT Phe 
645 

(2) INFORMATION FOR SEQ ID NQ:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:S: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
1 5 10 

Leu Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 H3 

Thr Asp Ser Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 

50 55 
Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 " 70 75 
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Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 90 95 

Ala Val Phe Arg Leu Leu Gin Glu His Lys Gly Lys Lys Ala Arg Leu 
100 105 HO 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 
K 115 120 125 

Asp Phe Leu Asp His Val Pro He Thr Thr His Asn Phe Ala Arg Lys 
130 135 HO 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 155 160 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys" 
165 170 175 

Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn lie Arg Gin 
180 185 190 

Leu Leu Leu Phe Pro Asn Ser Thr Val Gly Asp Ser Gly Val Pro Ala 
195 200 205 

Pro Pro Ser Phe Pro Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 
210 215 220 

Pro Ala Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 235 240 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 
245 250 255 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu His Val 
260 265 270 

Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 
275 280 285 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
290 295 300 

Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 315 320 

Gly Ser Gly Thr Gin Gin Lys Asn Lys He Arg Pro Arg Gly Gin Arg 
325 330 335 

Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 
340 345 350 

Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 360 365 
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Hts Gly Asp Val Ala Val Lys He Leu Lys Val Vail Asp Pro Thr Pro 
370 375 380 

Glu Gin Leu Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 395 400 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
405 410 415 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 



4Z0 



425 



Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
435 440 445 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 
450 455 460 

He His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 



465 



470 



475 



Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 



485 



490 



Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 



500 



505 



Ala Pro Glu Val He Arg Met Gin Asp Asp Asn Pro Phe Ser Phe Gin 
515 520 525 

Ser Thr Cys Thr Phe Tyr Gly He Val Leu Tyr Glu Leu Met Ala Gly 
530 535 540 

Glu Leu Pro Tyr Ala His He Asn Asn Arg Asp Gin He He Phe Met 
545 550 555 560 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Arg Leu Tyr Lys Asn 



565 



570 



Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 



580 



585 



Lys Glu Glu Arg Pro Leu Phe Pro Gin lie Leu Ser Ser He Glu Leu 

— cnn 605 



595 



600 



Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
610 615 oZQ 

Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 



625 



630 



635 



Thr Ser Pro Arg Leu Pro Val Phe 
645 
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(2) INFORMATION FOR SEQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

AACTTGTGGT GGTTGGACCT 20 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:7: 
TCATCCACAA AGTGATTCTG 20 

(2) INFORMATION FOR SEQ ID N0:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: . 
AGGAGACCAA GTTTCAGATG 20 

(2) INFORMATION FOR SEQ ID N0:9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 
GCGTGCAAGC ATTGATATCC 20 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1947 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



ATGGAGCACA 


TACAGGGAGC 


TTGGAAGACG ATCAGCAATG 


GCTTTGGACT 


CAAAGATGCG 


60 


GTGTTTGATG 


GCTCCAGCTG 


CATCTCCCCT ACCATTGTTC 


AGCAGTTTGG 


CTATCAGCGC 


120 


CGGGCCTCAG 


ATGATGGCAA 


GCTCACGGAT TCTTCTAAGA 


CAAGCAATAC 


TATCCGGGTT 


180 


TTCTTGCCGA 


ATAAGCAAAG 


GACTGTGGTC AATGTGCGGA 


ATGGAATGAG 


CTTACATGAC 


240 


TGCCTTATGA 


AAGCTCTGAA 


GGTGAGAGGC CTGCAGCCAG 


AGTGCTGTGC 


AGTGTTCAGA 


300 


CTTCTCCAGG 


AACACAAAGG 


TAAGAAAGCA CGCTTAGATT 


GGAACACCGA 


TGCCGCCTCT 


360 


CTGATTGGAG 


AAGAACTGCA 


AGT6GATTTT TTGGATCATG 


TTCCCATCAC 


AACTCACAAC 


420 


TTTGCTCGGA 


AAAC6TTCCT 


GAAGCTTGCA TTCTGTGACA 


TCTGTCAGAA 


GTTCCTGCTA 


480 


AATGGATTTC 


GATGTCAGAC 


TTGTGGCTAC AAGTTTCATG 


AGCACTGTAG 


CACCAAAGTA 


540 


CCTACTATGT 


GTGTGGACTG 


GAGTAATATC AGACAGCTCT 


TGCTGTTTCC 


AAATTCCACT 


600 


GTTGGTGACA 


GTGGAGTCCC 


AGCACCACCT TCTTTCCCAA 


TGCGTCGGAT 


GCGAGAATCT 


660 


GTTTCCCGGA 


TGCCTGCTAG 


TTCCCAGCAC AGATACTCTA 


CACCCCATGC 


CTTCACTTTC 


720 


AACACCTCCA 


GCCCTTCCTC 


AGAAGGTTCC CTCTCCCAGA 


GGCAGAGGTC 


AACGTCCACT 


780 


CCCAATGTCC 


ACATGGTCAG 


CACCACCCTG CATGTGGACA 


GCAGGATGAT 


TGAGGATGCA 


840 


ATTCGAAGTC 


ACAGTGAATC 


AGCCTCACCT TCAGCCCTGT 


CCAGCAGCCC 


AAACAACCTG 


900 


GGTCCAACAG 


GCTGGTCACA 


GCCCAAAACC CCCGTGCCAG 


CACAAAGAGA 


GCGGGCACCA 


960 


GGATCTGGGA 


CCCAGCAAAA 


AAACAAAATT AGGCCTCGTG 


GGCAGAGAGA 


CTCGAGTTAT 


1020 


TACTGGGAAA 


TAGAAGCCAG 


TGAGGTGATG CTGTCTACTC 


GGATCGGGTC 


AGGTTCCTTT 


1080 


GGCACTGTGT 


ACAAGGGCAA 


GTGGCATGGA GATGTTGCAG 


TAAA6ATCCT 


AAAGGTGGTT 


1140 


GACCCAACTC 


CAGAGCAACT 


TCAGGCCTTC AGGAACGAGG 


TGGCTGTTTT 


GCGCAAAACA 


1200 


CGGCATGTTA 


ACATCCTGCT 


GTTCATGGGG TACATGACAA 


AGGACAACCT 


GGCGATTGTG 


1260 


ACTCAGTGGT 


GTGAAGGCAG 


CAGTCTCTAC AAACACCTGC 


ATGTCCAGGA 


GACCAAATTC 


1320 


CAGATGTTCC 


AGCTAATTGA 


CATTGCCCGA CAGACAGCTC 


AGGGAATGGA 


CTATTTGCAT 


1380 


GCAAAGAACA 


TCATCCACA6 


AGACATGAAA TCCAACAATA 


TATTTCTCCA 


TGAAGGCCTC 


1440 


ACGGTGAAAA 


TTGGAGATTT 


TGGTTTGGCA ACAGTGAAGT 


CACGCTGGAG 


TGGTTCTCAG 


1500 


CAGGTTGAAC 


AGCCCACTGG 


CTCTGTGCT6 TGGAT6GCCC 


CAGAAGTAAT 


CCGGATGCAG 


1560 
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GATGACAACC CGTTCAGCTT CCAGTCCGAC GTGTACTCGT ACGGCATCGT GCTGTACGAG 1620 

CTGATGGCTG GGGAGCTTCC CTACGCCCAC ATCAACAACC GAGACCAGAT CATCTTCATG 1680 

GTAGGCCGTG GGTATGCATC CCCTGATCTC AGCAGGCTCT ACAAGAACTG CCCCAAGGCA 1740 

ATGAAGAGGT TGGTGGCT6A CTGTGT6AAG AAAGTCAAAG AAGAGAGACC TTTGTTTCCC 1800 

CAGATCCTGT CTTCCATC6A GCTGCTTCAG CACTCTCTGC CGAAAATCAA CAGGAGCGCC 1860 

TCTGAGCCTT CCCTGCATCG GGCAGCTCAC ACTGAGGACA TCAATGCTTG CACGCTGACT 1920 

ACATCCCCAA GGCTACCAGT CTTCTAG . 1947 

(2) INFORMATION FOR SEQ ID N0:11: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1947 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(0) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..1944 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

ATG GAG CAC ATA CAG GGA GCT TGG AAG ACG ATC AGC AAT GGT TTT GGA 48 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
1 5 10 15 

TTC AAA GAT GCC GTG TTT GAT GGC TCC AGC TGC ATC TCT CCT ACA ATA 96 

Phe Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 30 

GTT CAG CAG TTT GGC TAT CAG CGC CGG GCA TCA GAT GAT GGC AAA CTC 144 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 40 45 

ACA GAT CCT TCT AAG ACA AGC AAC ACT ATC CGT GTT TTC TTG CCG AAC 192 

Thr Asp Pro Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 55 60 

AAG CAA AGA ACA GTG GTC AAT GTG CGA AAT GGA ATG AGC TTG CAT GAC 240 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 75 80 
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TGC CTT AT6 
Cys Leu Met 

GCA GTG TTC 
Ala Val Phe 



AAA GCA CTC AAG 

Lys Ala Leu Lys 
85 

AGA CTT CTC CAC 
Leu His 



Arg Leu 
100 



GAT TGG AAT ACT GAT 

Asp Trp Asn Thr Asp 
115 

GAT TTC CTG GAT CAT 

Asp Phe Leu Asp His 
130 

ACG TTC CTG AAG CTT 
Lys Leu 



GCT GCG 
Ala Ala 
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GTG AGG GGC CTG 

Val Arg Gly Leu 
90 

GAA CAC AAA GGT 

Glu His Lys Gly 
105 

TCT TTG ATT GGA 
He Gly 



Ser Leu 
120 



Thr Phe Leu 
145 

AAT GGA TTT 
Asn Gly Phe 

AGC ACC AAA 
Ser Thr Lys 



GTT CCC CTC ACA 

Val Pro Leu Thr 
135 

GCC TTC TGT GAC 
Cys Asp 



ACA CAC 
Thr His 



CAA CCA GAG TGC TGT 288 

Gin Pro Glu Cys Cys 
95 

AAA AAA GCA CGC TTA 336 

Lys Lys Ala Arg Leu 
110 

GAA GAA CTT CAA GTA 384 

Glu Glu Leu Gin Val 
125 

AAC TTT GCT CGG AAG 432 

Asn Phe Ala Arg Lys 
140 



Ala Phe 
150 



ATC TGT CAG AAA TTC CTG CTC 480 
Gin Lys Phe 



He Cys 
155 



Leu Leu 
160 



CGA TGT CAG ACT 

Arg Cys Gin Thr 
165 

GTA CCT ACT ATG 
Thr Met 



TGT GGC 
Cys Gly 



TAC AAA TTT CAT GAG CAC TGT 528 
Phe His Glu 



Tyr Lys 
170 



Val Pro 
180 



CTC TTA TTG TTT CCA 

Leu Leu Leu Phe Pro 
195 

CTA CCT TCT TTG ACT 

Leu Pro Ser Leu Thr 
210 

CCT GTT AGT TCT CAG 
Ser Gin 



AAT TCC 
Asn Ser 



TGT GTG GAC TGG 

Cys Val Asp Trp 
185 

ACT ATT GGT GAT 
Gly Asp 



Thr He 
200 



ATG CGT CGT ATG 

Met Arg Arg Met 
215 



CGA GAG 
Arg Glu 



CAC AGA TAT TCT ACA CCT 



Pro Val Ser 
225 

AAC ACC TCC 
Asn Thr Ser 



His Arg 
230 



Tyr Ser Thr Pro 
235 



His Cys 
175 



AGT AAC ATC AGA CAA 576 

Ser Asn He Arg Gin 
190 

AGT GGA GTC CCA GCA 624 

Ser Gly Val Pro Ala 
205 

TCT GTT TCC AGG ATG 672 

Ser Val Ser Arg Met 
220 

CAC GCC TTC ACC TTT 720 

His Ala Phe Thr Phe 
240 



AGT CCC TCA TCT 

Ser Pro Ser Ser 
245 



GAA GGT TCC CTC TCC CAG AGG CAG AGG 768 
Ser Gin Arg 



Glu Gly Ser Leu 
250 



Gin Arg 
255 



TCG ACA TCC ACA CCT AAT GTC CAC ATG GTC AGC ACC ACG CTG CCT GTG 816 
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Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu Pro Val 
260 265 270 

GAC AGC AGG ATG ATT GAG GAT GCA ATT CGA AGT CAC AGC GAA TCA GCC 864 

Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 
275 280 285 

TCA CCT TCA GCC CTG TCC AGT AGC CCC AAC AAT CTG AGC CCA ACA GGC 912 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
290 295 300 

TGG TCA CAG CCG AAA ACC CCC GTG CCA GCA CAA AGA GAG CGG GCA CCA 960 

Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 315 320 

GTA TCT GGG ACC CAG GAG AAA AAC AAA ATT AGG CCT CGT GGA CAG AGA 1008 

Val Ser Gly Thr Gin Glu Lys Asn Lys He Arg Pro Arg Gly Gin Arg 
325 330 335 

GAT TCA AGC TAT TAT TGG GAA ATA GAA GCC AGT GAA GTG ATG CTG TCC 1056 

Asp Ser Ser Tyr Tyr Trp Glu He Glu Ala Ser Glu Val Met Leu Ser 
340 345 350 

ACT CGG ATT GGG TCA GGC TCT TTT GGA ACT GTT TAT AAG GGT AAA TGG 1104 

Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 360 365 

CAC GGA GAT GTT GCA GTA AAG ATC CTA AAG GTT GTC GAC CCA ACC CCA 1152 

His Gly Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 
370 375 380 

GAG CAA TTC CAG GCC TTC AGG AAT GAG GTG GCT GTT CTG CGC AAA ACA 1200 

Glu Gin Phe Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 
385 390 395 400 

CGG CAT GTG AAC ATT CTG CTT TTC ATG GGG TAC ATG ACA AAG GAC AAC 1248 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
405 410 415 

CTG GCA ATT GTG ACC CAG TGG TGC GAG GGC AGC AGC CTC TAC AAA CAC 1296 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 
420 425 430 

CTG CAT GTC CAG GAG ACC AAG TTT CAG ATG TTC CAG CTA ATT GAC ATT 1344 
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Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
435 440 445 

GCC CGG CAG ACG GCT CAG GGA ATG GAC TAT TTG CAT GCA AAG AAC ATC 1392 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 
450 455 460 

ATC CAT AGA GAC ATG AAA TCC AAC AAT ATA TTT CTC CAT GAA GGC TTA 1440 

lie His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 

ACA GTG AAA ATT GGA GAT TTT GGT TTG GCA ACA GTA AAG TCA CGC TGG 1488 

Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 490 H * D 

AGT GGT TCT CAG CAG GTT GAA CAA CCT ACT GGC TCT GTC CTC TGG ATG 1536 

Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 

500 505 a 

GCC CCA GAG GTG ATC CGA ATG CAG GAT AAC AAC CCA TTC AGT TTC CAG 1584 

Ala Pro Glu Val He Arg Met Gin Asp Asn Asn Pro Phe Ser Phe Gin 
515 520 5Zb 

TCG GAT GTC TAC TCC TAT GGC ATC GTA TTG TAT GAA CTG ATG ACG GGG 1632 

Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Thr Gly 

535 * 



530 



GAG CTT CCT TAT TCT CAC ATC AAC AAC CGA GAT CAG ATC ATC TTC ATG 1680 
Glu Leu Pro Tyr Ser His He Asn Asn Arg Asp Gin He He Phe Met 



545 550 555 

GTG GGC CGA GGA TAT GCC TCC CCA GAT CTT AGT AAG CTA TAT AAG AAC 1728 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Lys Leu Tyr Lys Asn 
565 570 5/3 

TGC CCC AAA GCA ATG AAG AGG CTG GTA GCT GAC TGT GTG AAG AAA GTA 1776 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 585 ^ 

AAG GAA GAG AGG CCT CTT TTT CCC CAG ATC CTG TCT TCC ATT GAG CTG 1824 

Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
J sgs 600 60d 

CTC CAA CAC TCT CTA CCG AAG ATC AAC CGG AGC GCT TCC GAG CCA TCC 1872 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 



WO 93/06248 



PCT/US92/07817 



-39- 

610 615 620 

TTG CAT CGG GCA GCC CAC ACT GAG GAT ATC AAT GCT TGC ACG CTG ACC 1920 

Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 

625 " 630 635 640 

ACG TCC CCG AGG CTG CCT GTC TTC TAG 1947 

Thr Ser Pro Arg Leu Pro Val Phe 
645 



(2) INFORMATION FOR SEQ ID NO: 12: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi} SEQUENCE DESCRIPTION: SEQ ID N0:12: 

Met Glu His He Gin Gly Ala Trp Lys Thr He Ser Asn Gly Phe Gly 
15 10 15 

Phe Lys Asp Ala Val Phe Asp Gly Ser Ser Cys He Ser Pro Thr He 
20 25 30 

Val Gin Gin Phe Gly Tyr Gin Arg Arg Ala Ser Asp Asp Gly Lys Leu 
35 * 40 45 

Thr Asp Pro Ser Lys Thr Ser Asn Thr He Arg Val Phe Leu Pro Asn 
50 • " 55 60 

Lys Gin Arg Thr Val Val Asn Val Arg Asn Gly Met Ser Leu His Asp 
65 70 75 80 

Cys Leu Met Lys Ala Leu Lys Val Arg Gly Leu Gin Pro Glu Cys Cys 
85 90 95 

Ala Val Phe Arg Leu Leu His Glu His Lys Gly Lys Lys Ala Arg Leu 
100 105 HO 

Asp Trp Asn Thr Asp Ala Ala Ser Leu He Gly Glu Glu Leu Gin Val 
115 120 125 

Asp Phe Leu Asp His Val Pro Leu Thr Thr His Asn Phe Ala Arg Lys 
130 135 140 

Thr Phe Leu Lys Leu Ala Phe Cys Asp He Cys Gin Lys Phe Leu Leu 
145 150 155 160 

Asn Gly Phe Arg Cys Gin Thr Cys Gly Tyr Lys Phe His Glu His Cys 
165 170 175 
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Ser Thr Lys Val Pro Thr Met Cys Val Asp Trp Ser Asn lie Arg Gin 
180 

Leu Leu Leu Phe Pro Asn Ser Thr He Gly Asp Ser Gly Val Pro Ala 
igS 200 C}J0 

Leu Pro Ser Leu Thr Met Arg Arg Met Arg Glu Ser Val Ser Arg Met 
210 215 ZZ° 

Pro Val Ser Ser Gin His Arg Tyr Ser Thr Pro His Ala Phe Thr Phe 
225 230 235 

Asn Thr Ser Ser Pro Ser Ser Glu Gly Ser Leu Ser Gin Arg Gin Arg 
245 " 

Ser Thr Ser Thr Pro Asn Val His Met Val Ser Thr Thr Leu Pro Val 

260 265 
Asp Ser Arg Met He Glu Asp Ala He Arg Ser His Ser Glu Ser Ala 
275 280 

Ser Pro Ser Ala Leu Ser Ser Ser Pro Asn Asn Leu Ser Pro Thr Gly 
2go 295 30U 

Trp Ser Gin Pro Lys Thr Pro Val Pro Ala Gin Arg Glu Arg Ala Pro 
305 310 

Val Ser Gly Thr Gin Glu Lys Asn Lys lie Arg Pro Arg Gly Gin Arg 
325 " u 

Asp Ser Ser Tyr Tyr Trp Glu lie Glu Ala Ser Glu Val Met Leu Ser 
340 345 

Thr Arg He Gly Ser Gly Ser Phe Gly Thr Val Tyr Lys Gly Lys Trp 
355 • 360 

His Gly Asp Val Ala Val Lys He Leu Lys Val Val Asp Pro Thr Pro 

370 375 380 

Glu Gin Phe Gin Ala Phe Arg Asn Glu Val Ala Val Leu Arg Lys Thr 

Arg His Val Asn He Leu Leu Phe Met Gly Tyr Met Thr Lys Asp Asn 
405 410 

Leu Ala He Val Thr Gin Trp Cys Glu Gly Ser Ser Leu Tyr Lys His 



Leu His Val Gin Glu Thr Lys Phe Gin Met Phe Gin Leu He Asp He 
435 440 HH3 

Ala Arg Gin Thr Ala Gin Gly Met Asp Tyr Leu His Ala Lys Asn He 
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Ile His Arg Asp Met Lys Ser Asn Asn He Phe Leu His Glu Gly Leu 
465 470 475 480 

Thr Val Lys He Gly Asp Phe Gly Leu Ala Thr Val Lys Ser Arg Trp 
485 490 495 

Ser Gly Ser Gin Gin Val Glu Gin Pro Thr Gly Ser Val Leu Trp Met 
500 505 510 

Ala Pro Glu Val He Arg Met Gin Asp Asn Asn Pro Phe Ser Phe Gin 
515 " 520 525 

Ser Asp Val Tyr Ser Tyr Gly He Val Leu Tyr Glu Leu Met Thr Gly 
530 535 540 

Glu Leu Pro Tyr Ser His He Asn Asn Arg Asp Gin He He Phe Met 
545 550 555 560 

Val Gly Arg Gly Tyr Ala Ser Pro Asp Leu Ser Lys Leu Tyr Lys Asn 
565 570 575 

Cys Pro Lys Ala Met Lys Arg Leu Val Ala Asp Cys Val Lys Lys Val 
580 585 590 

Lys Glu Glu Arg Pro Leu Phe Pro Gin He Leu Ser Ser He Glu Leu 
595 600 605 

Leu Gin His Ser Leu Pro Lys He Asn Arg Ser Ala Ser Glu Pro Ser 
610 615 620 

Leu His Arg Ala Ala His Thr Glu Asp He Asn Ala Cys Thr Leu Thr 
625 " 630 635 640 

Thr Ser Pro Arg Leu Pro Val Phe 
645 
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WHAT Ig rr.»TMKn Tf» : 

1 . A method of identifying an individual at an 
increased risk for developing cancer, comprising: 

amplifying a region of the c-raf-1 gene of said 
individual; 

analyzing products of said amplification for 
evidence of mutation; and 

classifying an individual having one or more 
mutations in said region as having an increased risk for 
developing cancer. 

2 The method according to claim 1 , wherein said 
region" encodes at least amino acids 514 to 535 of SEQ ID 
N0:12. 

3. The method according to claim 2, wherein said 
region encodes at least amino acids 500 to 550 of SEQ ID 
NO : 1 2 . 

4 The method according to claim 3, wherein said 
region encodes at least amino acids 450 to 630 of SEQ ID 
NO : 1 2 . 

5. The method according to claim 1, wherein said 
products are analyzed by DKA sequencing. 

6. The method according to claim 1, wherein said 
amplification is effected using a polymerase chain 
reaction (PCR). 

7. The method according to claim 6, wherein said 
PCR employs a primer comprising SEQ ID. NO: 7 and a primer 
comprising SEQ ID NO: 8. 

8. A method for determining a prognosis in 
patients afflicted with cancer, comprising: 
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amplifying a region of the c-raf-1 gene of said 
individual; 

analyzing products of said amplification for 
evidence of mutation; and 

classifying a patient having no mutation in said 
region as being less likely to suffer disease relapse or 
having an increased chance of survival than a patient 
having one or more mutations in said region. 

9. The method according to claim 8, wherein said 
region encodes at least amino acids 514 to 535 of SEQ ID 
NO: 12* 

10. The method according to claim 9, wherein said 
region encodes at least amino acids 500 to 550 of SEQ ID 
NO: 12. 

11. The method according to claim 10, wherein said 
region encodes at least amino acids 450 to 630 of SEQ ID 
NO:12. 

12. The method according to claim 9, wherein said 
products are analyzed by DNA sequencing. 

13. The method according to claim 9, wherein said 
amplification is effected using polymerase chain 
reaction (PCR). 

14. The method according to claim 13, wherein said 
PCR employs a primer comprising SEQ ID NO: 7 and a primer 
comprising SEQ ID NO: 8. 

15. A method for determining the proper course of 
treatment for a patient afflicted with cancer, 
comprising: 

amplifying a region of the c-raf-1 gene of said 
patient; 
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analyzing products of said amplification for 
evidence of mutation; 

identifying a patient having at least one mutation 
in said region, which patient may require treatment 
proper for a patient having a lesser chance of survival 
or decreased time to relapse; and 

identifying a patient lacking mutations in said 
region, which patient may require treatment proper for a 
patient having a greater chance of survival or being 
less likely to suffer disease relapse. 

16. The method according to claim 15, wherein said 
region encodes at least amino acids 514 to 535 of SEQ ID 
NO:12. 

17. The method according to claim 16, wherein said 
region encodes at least amino acids 500 to 550 of SEQ ID 
NO: 12. 

18. The method according to claim 17, wherein said 
region encodes at least amino acids 450 to 630 of SEQ ID 
NO: 12. 

19. The method according to claim 16, wherein said 
products are analyzed by DNA sequencing. 

20. The method according to claim 16, wherein said 
amplification is effected using a polymerase chain 
reaction (PCR) . 

21- The method according to claim 20, wherein said 
PCR employs a primer comprising SEQ ID N0:7 and a primer 
comprising SEQ ID NO: 8. 
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Figure 3 
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Figure 4 
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Figure 6 



1. Polymerase Chain Reaction (PCR) Amplification of Target DNA 
Either Genomic DNA or cDNA 

Nucleotide 1307 of coding Nucleotide 1326 of coding 

Primer 1 s 5'- AGGAGACCAAGTTTCAGATG -3' 

Nucleotide 1915 of coding Nucleotide 1896 of coding 

i I 

Primer 2 = 5 1 - GCGTGCAAGCATTGATATCC -3* 
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